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Abstract. Systems of equations with sets of integers as unknowns are considered. It is 
shown that the class of sets representable by unique solutions of equations using the oper- 
ations of union and addition S-\-T = {m -\- n\m £ S, n £ T} and with ultimately periodic 
constants is exactly the class of hyper-arithmetical sets. Equations using addition only can 
represent every hyper-arithmetical set under a simple encoding. All hyper-arithmetical sets 
can also be represented by equations over sets of natural numbers equipped with union, 
addition and subtraction S T — {m ~n\m£S,n£T,m^n}. Testing whether a given 
system has a solution is Ej-complete for each model. These results, in particular, settle 
the expressive power of the most general types of language equations, as well as equations 
over subsets of free groups. 
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1. Introduction 

Language equations are equations with formal languages as unknowns. The simplest 
such equations are the context-free grammars [1], as well as their generalization, the con- 
junctive grammars [15] . Many other types of language equations have been studied in the 
recent years, see a survey by Kunc [11], and most of them were found to have strong con- 
nections to comput ability. In particular, for equations with concatenation and Boolean 
operations it was shown by Okhotin [191 E] that the class of languages representable by 
their unique (least, greatest) solutions is exactly the class of recursive (r.e., co-r.e.) sets. 
A computationally universal equation of the simplest form was constructed by Kunc jlOj . 
who proved that the greatest solution of the equation XL = LX, where L C {a,b}* is a 
finite constant language, may be co-r.e. -complete. 

A seemingly trivial case of language equations over a unary alphabet Q = {a} has 
recently been studied. Strings over such an alphabet may be regarded as natural numbers, 
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and languages accordingly become sets of numbers. As established by the authors [8], these 
equations are as powerful as language equations over a general alphabet: a set of natural 
numbers is representable by a unique solution of a system with union and elementwise 
addition if and only if it is recursive. Furthermore, even without the union operation 
these equations remain almost as powerful [9]: for every recursive set S C N, its encoding 
cr(5) C N satisfying S = {n \ 16n + 13 G (^{S)} can be represented by a unique solution of a 
system using addition only, as well as ultimately periodic constants. At the same time, as 
shown by Lehtinen and Okhotin [12], some recursive sets are not representable without an 
encoding. 

Equations over sets of numbers are, on one hand, interesting on their own as a basic 
mathematical object. On the other hand, these equations form a very special case of 
language equations with concatenation and Boolean operations, which turned out to be 
as hard as the general case, and this is essential for understanding language equations. 
However, it must be noted that these cases do not exhaust all possible language equations. 
The recursive upper bound on unique solutions [19] is applicable only to equations with 
continuous operations on languages, and using the simplest non-continuous operations, 
such as homomorphisms or quotient p^, leads out of the class of recursive languages. In 
particular, a quotient with regular constants was used to represent all sets in the arithmetical 
hierarchy |18j . 

The task is to find a natural limit of the expressive power of language equations, which 
would not assume continuity of operations. As long as operations on languages are express- 
ible in first-order arithmetic (which is true for every common operation), it is not hard to 
see that unique solutions of equations with these operations always belong to the family of 
hyper- arithmetical sets [HI [20l [21]. This paper shows that this obvious upper bound is in 
fact reached already in the case of a unary alphabet. 

To demonstrate this, two abstract models dealing with sets of numbers shall be in- 
troduced. The first model are equations over sets of natural numbers with addition 
S + T = {m + n \ m & S, n £ T} and subtraction S — T = {m — n\m€S, n^T, m'^n} 
(corresponding to concatenation and quotient of unary languages), as well as set-theoretic 
union. The other model has sets of integers, including negative numbers, as unknowns, and 
the allowed operations are addition and union. The main result of this paper is that unique 
solutions of systems of either kind can represent every hyper- arithmetical set of numbers. 

The base of the construction is the authors' earlier result [8] on representing every 
recursive set by equations over sets of natural numbers with union and addition. In Sec- 
tion [21 this result is adapted to the new models introduced in this paper. The next task 
is representing every set in the arithmetical hierarchy, which is achieved in Section [3] by 
simulating existential and universal quantifiers over a recursive set. These arithmetical 
sets are then used in Section HI as constants for the construction of equations representing 
hyper-arithmetical sets. Finally, the constructed equations are encoded in Section [5] using 
equations over sets of integers with addition only and periodic constant sets. 

This result brings to mind a study by Robinson [20], who considered equations, in which 
the unknowns are functions from N to N, the only constant is the successor function and 
the only operation is superposition, and proved that a function is representable by a unique 
solution of such an equation if and only if it is hyper-arithmetical. Though these equations 
deal with objects different from sets of numbers, there is one essential thing in common: in 
both results, unique solutions of equations over second-order arithmetical objects represent 
hyper- arithmetical sets. 
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Some more related work can be mentioned. Halpern [5] studied the decision problem of 
whether a formula of Presburger arithmetic with set variables is true for all values of these 
set variables, and showed that it is Il^-complete. The equations studied in this paper can 
be regarded as a small fragment of Presburger arithmetic with set variables. 

Another relevant model are languages over free groups, which have been investigated, 
in particular, by Anisimov p] and by d'Alessandro and Sakarovitch [2]. Equations over sets 
of integers are essentially equations for languages over a monogenic free group. 

An important special case of equations over sets of numbers are expressions and circuits 
over sets of numbers, which are equations without iterated dependencies. Expressions and 
circuits over sets of natural numbers were studied by McKenzie and Wagner |13j, and a 
variant of these models defined over sets of integers was investigated by Travers [22] . 

2. Equations and their basic expressive power 

The subject of this paper are systems of equations of the form 

VmiXi , . . . , Xn) = 1pm{Xi, . . . ,Xn) 

where C Z are unknown sets of integers, and the expressions (fi and ipi use such oper- 
ations as union, intersection, complementation, as well as the main arithmetical operation 
of elementwise addition of sets, defined as S + T = {m + n \ m £ S, n G T}. Subtraction 
S — T = {m — n I m € S", n G T} shall be occasionally used. The constant sets contained 
in a system sometimes will be singletons only, sometimes any ultimately periodic constants 
will be allowed (a set of integers S C Z is ultimately periodic if there exist numbers d ^ 
and p ^ 1, such that n € S" if and only if n + p G S for all n with \n\ ^ d), and in 
some cases the constants will be drawn from wider classes of sets, such as all recursive sets. 
Systems over sets of natural numbers shall have subsets of N both as unknowns and as 
constant languages; whenever subtraction is used in such equations, it will be used in the 
form S-T = {S -T)nN. 

Consider systems with a unique solution. Every such system can be regarded as a 
specification of a set, and for every type of systems there is a natural question of what kind 
of sets can be represented by unique solutions of these systems. For equations over sets of 
natural numbers, these are the recursive sets: 

Proposition 1 (Jez, Okhotin [HI Thm. 4]). The family of sets of natural numbers rep- 
resentahle by unique solutions of systems of equations of the form ipi{Xi, . . . , X^) = 
ipi{Xi, . . . , Xn) with union, addition and singleton constants, is exactly the family of re- 
cursive sets. 

Turning to the more general cases of equations over sets of integers and of equations 
over sets of natural numbers with subtraction, an upper bound on their expressive power 
can be obtained by reformulating a given system in the notation of first-order arithmetic. 

Lemma 1. For every system of equations in variables Xi, . . . Xn using operations express- 
ible in first- order arithmetic there exists an arithmetical formula Eq{Xi, . . . , Xn), where 
Xi, . . . , Xn are free second-order variables, such that Eq{Si, . . . , Sn) is true if and only if 
Xi = Si is a solution of the system. 
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Constructing this formula is only a matter of reformulation. As an example, an equation 
Xi = Xj +Xk is represented by (Vn) [n G o {3n'){3n")n = n' + n" An' G Xj An" G Xk] . 
Now consider the following formulae of second-order arithmetic: 

ip{x) = {3Xi) . . . {3Xn)Eq{Xi, . . . ,X„) A x G Xi 

if'ix) = (VXi) . . . {yXn)Eq{Xi, ...,Xn)^xeXi 

The formula f{x) represents the membership of x in any solution of the system, while 
ip'{x) states that every solution of the system contains x. Since, by assumption, the system 
has a unique solution, these two formulae are equivalent and each of them specifies the 
first component of this solution. Furthermore, and if' belong to the classes Ti\ and 
respectively, and accordingly the solution belongs to the class = n II}, known as the 
class of hyper- arithmetical sets [14^ I21j. 

Lemma 2. For every system of equations in variables Xi,...Xn using operations and 
constants expressible in first-order arithmetic that has a unique solution Xi = Si, the sets 
Si are hyper- arithmetical. 

Though this looks like a very rough upper bound, this paper actually establishes the 
converse, that is, that every hyper-arithmetical set is representable by a unique solution of 
such equations. The result shall apply to equations of two kinds: over sets of integers with 
union and addition, and over sets of natural numbers with union, addition and subtraction. 
In order to establish the properties of both families of equations within a single construction, 
the next lemma introduces a general form of systems that can be converted to either of the 
target types of systems: 

Lemma 3. Consider any system of equations ip{Xi, . . . , Xm) = ip{Xi, . . . , Xm) and in- 
equalities (p{Xi, . . . ,Xm) C 'ip{Xi, . . . ,Xm) over sets of natural numbers that uses the fol- 
lowing operations: union; addition of a recursive constant; subtraction of a recursive con- 
stant; intersection with a recursive constant. Assume that the system has a unique solution 
Xi = Si ^ 'N. Then there exist: 

(1) a system of equations over sets of natural numbers in variables 
Xi, . . . , XmiYi, ■ ■ ■ ,Ym' using the operations of addition, subtraction and union 
and singleton constants, which has a unique solution with Xi = Si; 

(2) a system of equations over sets of integers in variables Xi, . . . , Xm, Yi, . . . , Y^' using 
the operations of addition and union, singleton constants and the constants N and 
— N, which has a unique solution with Xi = Si. 

Inequalities ip C can be simulated by equations 93 U V' = V'- For equations over sets 
of natural numbers, each recursive constant is represented according to Proposition [H and 
this is sufficient to implement each addition or subtraction of a recursive constant by a large 
subsystem using only singleton constants. In order to obtain a system over sets of integers, 
a straightforward adaptation of Proposition [1] is needed: 

Lemma 3.1. For every recursive set C N there exists a system of equations over sets of 
integers in variables Xi, . . . ,X„ using union, addition, singleton constants and constant N, 
such that the system has a unique solution with Xi = S. 

This is essentially the system given by Proposition [TJ with additional equations Xi C N. 

Now a diff'erence X — R for a recursive constant i? C N shall be represented as {X + 
{—R)) n N, where the set —R = {— n | n G i?} is specified by taking a system for R and 
applying the following transformation: 
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Lemma 3.2 (Representing sets of opposite numbers). Consider a system of equations over 
sets of integers, in variables Xi, . . . ,Xn, using union and addition, and any constant sets, 
which has a unique solution Xi = Si. Then the same system, with each constant C C Z 
replaced by the set of the opposite numbers —C, has the unique solution Xi = —Si. 

The last step in the proof of Lemma [3] is eliminating intersection with recursive con- 
stants. This is done as follows: 

Lemma 3.3 (Intersection with constants). Let i? C N 6e a recursive set. Then there exists 
a system of equations over sets of natural numbers using union, addition and singleton 
constants, which has variables X,Y,Y' , Zi, . . . , Zm, such that the set of solutions of this 
system is 

{{X = S, Y = SnR, Y' = SnR, Zi = Si) \ S QN}, 
where Si, ... , Sm are some fixed sets. 

In plain words, the constructed system works as if an equation Y = X R (and also as 
another equation Y' = X Ci R, which may be ignored) . This completes the transformations 
needed for Lemma El 

The last basic element of the construction is representing a set of integers (both positive 
and negative) by first representing its positive and negative subsets individually: 

Lemma 4 (Assembling positive and negative subsets). Let sets SDN and {—S) H N 5e 
representable by unique solutions of equations over sets of integers using union, addition, 
and ultimately periodic constants. Then S is representable by equations over integers using 
only union, addition and ultimately periodic constants. 

3. Representing the arithmetical hierarchy 

Each arithmetical set can be represented by a recursive relation with a quantifier prefix, 
and arithmetical sets form the arithmetical hierarchy based on the number of quantifier 
alternations in such a formula. The bottom of the hierarchy are the recursive sets, and 
every next level is comprised of two classes, or 11^, which correspond to the cases of the 
first quantifier's being existential or universal. For every k ^ 1, a set is in if it can be 
represented as 

{w I 3xiix2 . . . QkXk R{w, xi, . . . , Xfc)} 
for some recursive relation R, where Qk = V if A; is even and Qk = 3 if /c is odd. A set is 
in if it admits a similar representation with the quantifier prefix \/xi3x2 ■ ■ ■ QkXk- It is 
easy to see that = {L \ L G T,^}. The sets and are the recursively enumerable 
sets and their complements, respectively. The arithmetical hierarchy is known to be strict: 
C ^^j^i and II^ C for every /c ^ 0. Furthermore, for every k ^ 1 the inclusion 

U C ^k+i ^ ^k+i proper, i.e., there is a gap between the k-th and {k + l)-th level. 
For this paper, the definition of arithmetical sets shall be arithmetized in base-7 nota- 
tioriil as follows: a set S* C N is in SV if it is representable as 

S = {{w)7 1 3x1 e {3,6}*Vx2 e {3,6}* ...QkXk € {3, 6}*(lxilyil . . . Xfclyfclu')7 G R}, 

for some recursive set i? C N, where {w)i for w € {0, 1, . . . , 6}* denotes the natural number 
with base-7 notation w. The strings Xj € {3, 6}* represent binary notation of some numbers, 

^Base 7 is the smallest base, for which the details of the constructions could be conveniently implemented. 
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where 3 stands for zero and 6 stands for one. The notation (x)2 for x € {3,6}* shall be 
used to denote the number represented by this encoding. The digits 1 act as separators. 
Throughout this paper, the set of base-7 digits {0, 1, . . . , 6} shall be denoted by Oy. 

In general, the construction of a system of equations representing the set S begins with 
representing and proceeds with evaluating the quantifiers, eliminating the prefixes Ixi, 
1x2, and so on until Ix^. In the end, all numbers (1^)7 with {w)j G S will be produced. 
These manipulations can be expressed in terms of the following three functions: 

Removei{X) = {{10)7 \ (110)7 € X}, 

E{X) = {{lw)7 I 3x G {3,6}* : {xlw)7 £ X}, 

A{X) = {{lw)7 I Vx € {3, 6}* : (xlw)7 € X}. 

The expression converting numbers of the form {lw)7 to (w)? is constructed as follows: 

Lemma 5 (Removing leading digit 1). The value of the expression 

(x-{i}n{o})u U U [(^n(iiO*7(o2)*)7)-(io*)7]n(iO*(o2)*)7 (3.1) 

jGf^7\{o} te{o,i} 

on any S C (1(^17 \ 0il.Y))7 is {iw)7 \ {lw)7 G S}. The value on S Q {I0il.j)7 equals 0. 

With Lemma [5] established and the expression (|3.ip proved to implement the function 
Removei{X), the notation Removei{X) is used in equations to refer to this subexpression. 

Next, consider the function E{X) representing the existential quantifier ranging over 
strings in {3, 6}*. This function can be implemented by a single expression as follows: 

Lemma E (Representing the existential quantifier) . The value of the expression 

(x n (10^)7) u ( [(X n ({3, 6}+if]*)7) - ({3, 6}+o*)7] n (11^^)7) 

on any S C ({3, 6}*lJ7^)7 is E{S) = {(1^)7 | ^w' G {3, 6}*(u''l?i;)7 G S}. 

Note that E{X) can already produce any recursively enumerable set from a recursive 
argument, and therefore it is essential to use subtraction in the expression. 

With the existential quantifier implemented, the next task is to represent a universal 
quantifier. Ideally, one would be looking for an expression implementing A{X), but, unfor- 
tunately, no such expression was found, and the actual construction given below implements 
the universal quantifier using multiple equations. The first step is devising an equation rep- 
resenting the function f{X) = {(xltt;)7 | x G {3,6}*, {lw)7 G X}, which appends every 
string of digits in {3, 6}* to numbers in its argument set. 

Lemma 6. For every constant set X C (1^1^)7, the equation 
Y = X U AppendzfiiY), where 

Appends^Y) = \J [( [(y n (^1)^)7) + (20*)7] n (2jJ7^)7) + {{i - 2)0*)7] n iij^}*7)7 

i,ie{3,6} 

u U [{Y n {in*j)7) + {io*)7] n (tin*j)7 

ie{3,6} 

has the unique solution Y = {(xlw)7 | x G {3, 6}*, (lw)7 G X}. 
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Lemma A (Representing the universal quantifier). Let S,S Q (^{3,6}*li}j)7 be any sets, 
such that S nS = and for x', x G {3, 6}* {xlw)^ G S and {x'lw)^ ^ 5 implies {x'lw)i G 
S. Then the following system of equations over sets of integers in variables Y , Y and Z 

Y = Z U AppendsfiiY) 

Y = E{S) U Appends^eiY) 
Z C (ll7+)7 

Y C S QYUY, 

has the unique solution Z = A{S) = {{lw)f \ Vx G {3, 6}* : {xl'w)^ G S}, Y = {(yl'w)^ \ y G 
{3,6}*,Vx G {3,6}* : {xlw)7 £ S},Y = {(ylw)r\y G {3,6}*, 3a; G {3,6}* : (xlw)7 G S}. 

Once the above quantifiers process a number . . . lxiltt;)7, reducing it to 

{lw)j, the actual number {w)i is obtained from this encoding by Lemma [5j 

Theorem 1. Every arithmetical set S Q Z (S N) is representable as a component of 
a unique solution of a system of equations over sets of integers (sets of natural numbers, 
respectively) with using the operations of addition and union and ultimately periodic 

constants (addition, subtraction, union and singleton constants, respectively). 

4. Representing hyper-arithmetical sets 

Following Moschovakis [14^ Sec. 8E] and Aczel \V, Thm. 2.2.3], hyper- arithmetical sets 
Bi,B2, ■ ■ ■ shall be defined as the smallest effective a-ring, which is the recursion-theoretic 
counterpart to Borel sets (the smallest family of sets containing all open sets and closed 
under countable union and countable intersection). 

Let /i, /2, • • • be an enumeration of all partial recursive functions and let ti, T2 be two 
recursive functions. Then, for all A; G N, 

=N\{fc}, C,^(fc) = {fc} 

Moreover, for all numbers A; G N, if fk is a total function, then 

^r2(k) = U C'/fc(n), C^2{k) = n %(n), 
ngN n6N 

where the former operation is known as effective a-union, while the latter is effective a- 
intersection. Note that the only distinction between Be and Ce is that the former is defined 
as a union and the latter as an intersection. As the definitions are dual. Be = Ce- 

The family of sets B = {Be,Ce \ e G /}, where / C N is an index set, is called an 
effective tr-ring, if it contains {i3^-^(e), 0^-^(6) | e G N} and is closed under effective u-union 
and effective cr-intersection. Then the hyper-arithmetical sets are defined as the smallest 
effective a-ring, which can be formally defined as the least fixed point of a certain operator 
on the set A = 2^^^ , where a triple {e,Be,Ce) indicates that the sets Be and Cg have 
been defined for the index e in the above inductive definition, and an operator ^ : A ^ A 
represents one step of this inductive definition. Furthermore, this least fixed point can be 
obtained constructively by a transfinite induction on countable ordinals, which is essential 
for any proofs about hyper-arithmetical sets. It is known [HI Sec. 8E] [T, Thm. 2.2.3] 
that for some (easy) choices of ri and T2 the smallest effective c-ring coincides with A\ sets. 
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Fix those two functions and the corresponding B. Note that the definition is vaUd not for 
every choice of ri and T2: in particular, they must be one-to-one and have disjoint images. 

With every set & B one can associate a tree of Bg, labelled with sets from B: its root 
is labelled with Bg, and each vertex -B^2(e') ((^7^(6')' respectively) in the tree has children 
labelled with {Cf ,(„) | n G N} ({-B/ ,(n) | n G N}, respectively). Vertices of the form B^^(^f.i^ 
or C^j(e') have no children; these are the only leaves in the tree. 

A partial order -< is well-founded, if it has no infinite descending chain. Extending this 
notion to oriented trees, a tree is well-founded if it contains no infinite downward path. 

Lemma 7. For each pair of sets Be,Ce & B the trees of Be,Ce are well-founded. 

The well-foundedness of a set allows using the well-founded induction principle: given 
a property (f) and a well founded order -< on a set A, (j){n) is true for all n G A if 

(Vm -< n (f){m)) (j){n). 

This principle shall be used in the proof of the main construction, which is described in the 
rest of this section. Note, that the basis of the induction are -<;-minimal elements n of A, 
as for them (/)(n) has to be shown directly. 

Fix Big as the target set in the root. Consider a path of length k in this tree, going 
from Big to Cj^, Bi^, . . . , Bi^ (or Cj^, depending on the parity of k). Then, for each j-th 
set in this path, ij = ^)('^i) for some number uj, and the path is uniquely defined by 

the sequence of numbers rii, . . . , n^. Consider the binary encoding of each of these numbers 
written using digits 3 and 6 (representing zero and one, respectively) , and let Resolve be a 
partial function that maps finite sequences of such "binary" strings representing numbers 
ni, . . . , nfc to the number ik of the set or Cj^, in the end of this path. The value of this 
function can be formally defined by induction: 

ReS0lve{{)) = io, ReSolve{xi, ...,Xk) = fr-^^Resolve(xi,...,x,_^))(M2), 

Note that Resolve may be undefined if some T2-preimage is undefined. 

The goal is to construct a system of equations, such that the following two sets are 
among the components of its unique solution: 

Goalo = {{Ixklxk-i . . . Ixil0it;)7 | /c ^ 0,Xi G {3,6}*, {w)7 G -Bi?esoMa;i,...,a^fe)}' 

Goali = {{Ixklxk-i . . . Ixil0ii;)7 | A; ^ 0, G {3,6}*, (w)7 G Ciiesolve{xi,...,Xk)}- 

These sets encode the sets Bq,Bi,... needed to compute Bi^. In this way the (possi- 
bly infinite) amount of equations defining sets in hyper-arithmetical hierarchy is encoded 
in a finite amount of equations using only small number of variables. The set Bi in 
the node with path to the root encoded by Xk,Xk^i, . . . ,xi G {3,6}* is represented by 
{(ixfcl . . . Ixfcl0u))7 I {w)7 G Bi} C GogIq. 

The following set defines the admissible encodings, that is, numbers encoding paths in 
the tree of Bi^ : 

Admissible = {{Ixk^Xk-i^ ■ ■ ■ la;il0u;)7|A; ^ 0, G (3, 6}*,Resolve{xi, ... , Xk) is defined} 
The next two sets represent the leaves of the tree of Bi^, and the numbers in those leaves: 

Ro = {{Ixklxk-i . . . Ixil0i(;)7 I 

A; ^ 0, Xj G {3, 6}*, 3e G N : Resolve{xi, . . . , x^) = Ti(e), {w)^ G B^^^g^}, 
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Ri = {{Ixklxk-i . . . IxilOw)? I 

k ^ 0, Xi £ {3, 6}*, 3e G N : Resolve{xi, . . . , Xk) = Ti(e), {w)7 £ C^^(e)}. 

Lemma 8. The sets Goali, Admissible, Ri are r.e. sets, Resolve is an r.e. predicate. 



Consider the following system of equations: 

Xo = E{Removei_{Xi)) U Rq (4.1) 

Xi = ZURi (4.2) 

Y = E{Removei{Xi)) U Append^fiiY) (4.3) 

Y = Z U AppendsfiiY) (4.4) 

Y C Removei{Xo n Admissible) CYUY (4.5) 
Z C {in+)7 (4.6) 

Xo,Xi C Admissible (4.7) 

Xo n i?i = n i?o = (4.8) 



Its intended unique solution has Xq = Goalo and Xi = Goah, and accordingly encodes 
the set -BjQ, as well as all sets of B on which logically depends. The system implements 
the functions E{X) and A{X) to represent effective ir-union and cr-intersection, respectively. 
For that purpose, the expression for E(X) introduced in Lemma lEl as well as the system 
of equations implementing A{X) defined in Lemma El are applied iteratively to the same 
variables Xq and Xi. Intuitively, the above system may be regarded as an implementation 
of an equation Xq = A{E(Xq)) U const. 

The proof uses the principle of induction on well-founded structures. The membership 
of numbers of the form (ix^lxk-i . . . Ixil0tu)7 in the variables Xq and Xi, where k ^ 0, 
Xi G {3, 6}* and w G ^l^X 0^7) is first proved for larger /c's and then inductively extended 
down to A; = 0, which allows extracting Bi^ out of the solution. The well-foundedness of the 
tree of Bi^ means that although Bi^ depends upon infinitely many sets, each dependency is 
over a finite path ending with a constant, that is, the self-dependence of numbers in Xq,Xi 
on the numbers in Xq, Xi reaches a constant Rq, Ri in finitely many steps (yet the number 
of steps is unbounded). 

Lemma 9. The unique solution of the system (j4.ip - (j4.8p is 
Xq = GoalQ = {{Ixk . . . Ixil0?i;)7 | ^ 0,Xi G {3,6}*, {w)7 G Biiesolve{xi,...,Xk)} 

Xi = Goah = {{'i-Xk . . . lXilO?i;)7 | ^ 0,Xi G {3,6}*, {w)7 G CResolve{xi,...,Xk)} 

Y = {{xk+i'i-Xk ■ ■ ■ l3;il0u;)7 \ k^O,Xie {3,6}*,Vxfe+i : {w)7 G -B^esoM^^iv.^fc+i)} 

Y = {{xk+ilxk . . . I2;il0ti;)7 \ k^O,Xi £ {3, 6}*, 3x^+1 : {w)7 G C^esoM^^iv.^fc+i)} 
Z = Goah \Ri = {(iXk . . . Ia;il0tt;)7 | 

/c ^ 0, e G N, G {3, 6}* , Resolve{xi, . . . , Xk) = T"2(e), {w)^ G C^2(e)} 

Then, in order to obtain the set Bi^, it remains to intersect Xq = GoalQ with the 
recursive constant set (10^17)7, and then remove the leading digits 10 by a construction 
analogous to the one in Lemma [5l 

Theorem 2. For every hyper- arithmetical set i3 C Z (B QN) there is a system of equations 
over subsets of Z, (over subsets of N, respectively) using union, addition and ultimately 
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periodic constants (union, addition, subtraction and singleton constants, respectively), such 
that (B, . . .) is its unique solution. 

5. Equations with addition only 

Equations over sets of natural numbers with addition as the only operation can represent 
an encoding of every recursive set, with each number n € N represented by the number 
16n + 13 in the encoding [9]. In order to define this encoding, for each i G {0, 1, ... , 15} 
and for every set 5* C Z, denote: 

Ti{S) = {Wn + i\ n G S}. 

The encoding of a set of natural numbers 5 C N is defined as 

S = ao{S) = {0} U T6(N) U r8(N) U T9(N) U ri2(N) U m{S), 

Proposition 2 ([9, Thm. 5.3]). For every recursive set S there exists a system of equations 
over sets of natural numbers in variables X,Yi, . . . , Ym using the operation of addition and 
ultimately periodic constants, which has a unique solution with X = aQ{S). 

This result is proved by first representing the set S by a system with addition and union, 
and then by representing addition and union of sets using addition of their ao-encodings. 

The purpose of this section is to obtain a similar result for equations over sets of integers: 
namely, that they can represent the same kind of encoding of every hyper-arithmetical set. 
For every set C Z, define its encoding as the set 

S = a{S) = {0} U T6(Z) U T8(Z) U r9(Z) U ri2(Z) U ti^{S). 

The subset S fl {16n + z | n G Z} is called the i-th track of S. 

The first result on this encoding is that the condition of a set X being an encoding of 
any set can be specified by an equation of the form X -\- C = D. 

Lemma 10 (cf. [U Lemma 3.3]). A set X C satisfies an equation 

X + {0,4,11}= J Ti(Z)U{ll} 

iG{0,l,3,4,6,7, 
8,9,10,12,13} 

if and only if X = cr{X) for some X C Z. 

Now, assuming that the given system of equations with union and addition is decom- 
posed to have all equations of the form X = Y + Z,X = Y[JZorX = const, these 
equations can be simulated in a new system as follows: 

Lemma 11 (cf. P Lemma 4.1]). For all sets X,Y,Z C Z, 

a{Y) + a{Z) + {0,1} =(7{X) + a{{0}) + {0,1} if and only if Y + Z = X 
a(Y) + a{Z) + {0,2} = a{X) + a{X) + {0,2} if and only if YUZ = X. 

Using these two lemmata, one can simulate any system with addition and union by a 
system with addition only. Taking systems representing different hyper-arithmetical sets, 
the following result on the expressive power of systems with addition can be established: 
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Sets representable 
by unique solutions 


Complexity of decision problems 


solution existence 


solution uniqueness 


over 2^, with {+,U} 
over 2^, with {+} 


A'j' (recursive) [8j 
encodings of [9j 


n^-complete [8] 
n^-complete [S] 


n^-complete [8j 
Ilg-complete [5] 


over 2^~^, with {+, - ,U} 
over 2^, with {+, U} 
over 2^, with {+} 


A\ (hyper-arithmetical) 
encodings of A} 


S}-complete 
Ej-complete 
Sj-complete 


n{ ^ • ^ A^ 

^ • ^ Ai 
n} ^ • ^ A^ 



Table 1: Summary of the results. 



Theorem 3. For every hyper- arithmetical set S QI^ there exists a system of equations over 
sets of integers using the operation of addition and ultimately periodic constants, which has 
a unique solution with Xi = T , where S = {n\ 16n € T}. 

6. Decision problems 

Having a solution (solution existence) and having exactly one solution (solution unique- 
ness) are basic properties of a system of equations. For language equations with continuous 
operations, solution existence is Il^-complete [19], and it remains Il^-complete already in the 
case of a unary alphabet, concatenation as the only operation and regular constants [9], that 
is, for equations over sets of natural numbers with addition only. For the same formalisms, 
solution uniqueness is llg-complete. 

Consider equations over sets of integers. Since their expressive power extends beyond 
the arithmetical hierarchy, the decision problems should accordingly be harder. In fact, 
the solution existence is S^-complete, which will now be proved using a reduction from the 
following problem: 

Proposition 3 (Rogers [21, Thm. 16-XX]). Consider trees with nodes labelled by finite 
sequences of natural numbers, such that a node (xi, . . . , Xk^i,xiS) is a son of {xi, . . . , Xk-i), 
and the empty sequence e is the root. Then the following problem is Ii\-complete: "Given 
a description of a Turing machine recognizing the set of nodes of a certain tree, determine 
whether this tree has no infinite paths". 

In other words, a given Turing machine recognizes sequences of natural numbers, and 
the task is to determine whether there is no infinite sequence of natural numbers, such that 
all of its prefixes are accepted by the machine. The S}-complete complement of the problem 
is testing whether such an infinite sequence exists, and it can be reformulated as follows: 

Corollary 1. The following problem is Ti\-complete: "Given a Turing machine M working 
on natural numbers, determine whether there exists an infinite sequence of strings {xi}'^^ 
with Xi € {3,6}*, such that M accepts {ix^lxk-i ■ ■ ■ lxil)7 for all k ^ 0". 

This problem can be reduced to testing existence of a solution of equations over sets of 
numbers. 

Theorem 4. The problem of whether a given system of equations over sets of integers with 
addition and ultimately periodic constants has a solution is T\-complete. 

Now consider the solution uniqueness property. The following upper bound on its 
complexity naturally follows by definition: 
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Theorem 5. The problem of whether a given system of equations over sets of integers 
using addition and ultimately periodic constants has a unique solution can he represented 
as a conjunction of a T,\- formula and a Il\-formula, and is accordingly in At the same 
time, the problem is Il\-hard. 

The exact hardness of testing solution uniqueness is still open. The properties of dif- 
ferent families of equations over sets of numbers are summarized in Table [TJ 
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